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Bacillus cereus strain Al is a newly isolated hydrogen producer capable of utilizing bioresources and biowaste, such as starch 
and starch wastewater. Here, we present a 5.67-Mb assembly of the genome sequence of strain Al , which may provide insights 
into the molecular mechanism of hydrogen production from bioresources and biowaste. 
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Hydrogen (H 2 ) has been considered one of the potential energy 
carriers for the future due to its advantages over classical hy- 
drocarbon fuels, such as high efficiency, recyclability, and nonpol- 
luting nature (1, 2). Various types of biomass have been used for 
H 2 production via biological processes (1, 3). Starch is rich in 
carbohydrates and its hydrolyzed sugar can be easily converted to 
H 2 by bacteria (4) . Corn and cassava starches are abundantly avail- 
able in China, as well as the wastewater from the starch manufac- 
turing process (4, 5). Although large numbers of organisms, in 
both pure and mixed cultures, have been widely studied for H 2 
production from biomass (1, 6-8), H 2 producers using unhydro- 
lyzed starch directly are rare (4, 9). 

Bacillus cereus strain Al is a facultative anaerobe isolated from 
the activated sludge of an anaerobic digestion reactor. Strain Al 
hydrolyzes starch by secreting amylase and fermenting glucose 
into H 2 (4, 9). In addition to glucose and unhydrolyzed starch, 
strain Al also utilizes biowastes such as starch wastewater to pro- 
duce H 2 (data not shown in detail). The end products are acetate 
and ethanol (4). Interestingly, when B. cereus strain Al was cocul- 
tured with another H 2 producer, Brevumdimonas naejangsanensis 
strain Bl, a high efficiency of H 2 production was obtained and the 
end products were mainly butyrate and acetate (4). B. cereus strain 
Al quenches 0 2 and provides substrates for H 2 production in a 
mixed culture, showing promising potential applications in the 
biohydrogen industry (1,4). 

Here, we present the draft genome sequence of B. cereus strain 
Al, obtained by using the Illumina HiSeq 2000 system, which was 
performed by the Chinese National Human Genome Center, 
Shanghai, China, with a paired-end library. The reads were assem- 
bled into 58 contigs by using VELVET (10). The genome annota- 
tion was performed by the RAST server (11). The G+C content 
was calculated by using the genome sequence. 

The draft genome sequence of strain Al comprises 5,667,342 
bases, with a G + C content of 34.9%. The genome was predicted to 
contain 5,699 coding sequences (CDSs) together with 112 RNAs. 
According to the annotation, we have predicted CDSs related to a 
pentose metabolite pathway. The transketolase/transaldolase 
pathway, instead of phosphoketolase, was predicted in the ge- 



nome, implying that strain Al might utilize pentose more effi- 
ciently. Five CDSs responsible for sucrose utilization, 10 CDSs for 
fructose utilization, and 16 CDSs for glycerol utilization, in addi- 
tion to alpha-amylase-encoding-genes, were also annotated, sug- 
gesting that strain Al may have a wide substrate spectrum. More- 
over, there are 12 CDSs related to acetoin and butanediol 
metabolism. Several CDSs responsible for organic acid formation 
were annotated, including formate, acetate, lactate, and butyrate. 
Further investigation of the pathways and key genes may help to 
reveal the mechanism of H 2 production in strain Al and to im- 
prove the production efficiency. 

Nucleotide sequence accession numbers. The whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der the accession number IHOG00000000. The version described 
in this paper is the first version, JHOG01000000. 
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